Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Plant Genome ; 15(2): e20200, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35307964

RESUMO

The ability to accurately quantify the simultaneous effect of multiple genomic loci on multiple traits is now possible due to current and emerging high-throughput genotyping and phenotyping technologies. To date, most efforts to quantify these genotype-to-phenotype relationships have focused on either multi-trait models that test a single marker at a time or multi-locus models that quantify associations with a single trait. Therefore, the purpose of this study was to compare the performance of a multi-trait, multi-locus stepwise (MSTEP) model selection procedure we developed to (a) a commonly used multi-trait single-locus model and (b) a univariate multi-locus model. We used real marker data in maize (Zea mays L.) and soybean (Glycine max L.) to simulate multiple traits controlled by various combinations of pleiotropic and nonpleiotropic quantitative trait nucleotides (QTNs). In general, we found that both multi-trait models outperformed the univariate multi-locus model, especially when analyzing a trait of low heritability. For traits controlled by either a combination of pleiotropic and nonpleiotropic QTNs or a large number of QTNs (i.e., 50), our MSTEP model often outperformed at least one of the two alternative models. When applied to the analysis of two tocochromanol-related traits in maize grain, MSTEP identified the same peak-associated marker that has been reported in a previous study. We therefore conclude that MSTEP is a useful addition to the suite of statistical models that are commonly used to gain insight into the genetic architecture of agronomically important traits.


Assuntos
Estudo de Associação Genômica Ampla , Locos de Características Quantitativas , Estudo de Associação Genômica Ampla/métodos , Fenótipo , Glycine max/genética , Zea mays/genética
2.
Plant Genome ; 13(1): e20009, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-33016627

RESUMO

Successful management and utilization of increasingly large genomic datasets is essential for breeding programs to accelerate cultivar development. To help with this, we developed a Sorghum bicolor Practical Haplotype Graph (PHG) pangenome database that stores haplotypes and variant information. We developed two PHGs in sorghum that were used to identify genome-wide variants for 24 founders of the Chibas sorghum breeding program from 0.01x sequence coverage. The PHG called single nucleotide polymorphisms (SNPs) with 5.9% error at 0.01x coverage-only 3% higher than PHG error when calling SNPs from 8x coverage sequence. Additionally, 207 progenies from the Chibas genomic selection (GS) training population were sequenced and processed through the PHG. Missing genotypes were imputed from PHG parental haplotypes and used for genomic prediction. Mean prediction accuracies with PHG SNP calls range from .57-.73 and are similar to prediction accuracies obtained with genotyping-by-sequencing or targeted amplicon sequencing (rhAmpSeq) markers. This study demonstrates the use of a sorghum PHG to impute SNPs from low-coverage sequence data and shows that the PHG can unify genotype calls across multiple sequencing platforms. By reducing input sequence requirements, the PHG can decrease the cost of genotyping, make GS more feasible, and facilitate larger breeding populations. Our results demonstrate that the PHG is a useful research and breeding tool that maintains variant information from a diverse group of taxa, stores sequence data in a condensed but readily accessible format, unifies genotypes across genotyping platforms, and provides a cost-effective option for genomic selection.


Assuntos
Sorghum , Análise Custo-Benefício , Genoma , Genômica , Haplótipos , Sorghum/genética
3.
Genetics ; 204(1): 99-113, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27412713

RESUMO

Strong directional selection occurred during the domestication of maize from its wild ancestor teosinte, reducing its genetic diversity, particularly at genes controlling domestication-related traits. Nevertheless, variability for some domestication-related traits is maintained in maize. The genetic basis of this could be sequence variation at the same key genes controlling maize-teosinte differentiation (due to lack of fixation or arising as new mutations after domestication), distinct loci with large effects, or polygenic background variation. Previous studies permit annotation of maize genome regions associated with the major differences between maize and teosinte or that exhibit population genetic signals of selection during either domestication or postdomestication improvement. Genome-wide association studies and genetic variance partitioning analyses were performed in two diverse maize inbred line panels to compare the phenotypic effects and variances of sequence polymorphisms in regions involved in domestication and improvement to the rest of the genome. Additive polygenic models explained most of the genotypic variation for domestication-related traits; no large-effect loci were detected for any trait. Most trait variance was associated with background genomic regions lacking previous evidence for involvement in domestication. Improvement sweep regions were associated with more trait variation than expected based on the proportion of the genome they represent. Selection during domestication eliminated large-effect genetic variants that would revert maize toward a teosinte type. Small-effect polygenic variants (enriched in the improvement sweep regions of the genome) are responsible for most of the standing variation for domestication-related traits in maize.


Assuntos
Zea mays/genética , Mapeamento Cromossômico/métodos , Cromossomos de Plantas , Domesticação , Genes de Plantas , Variação Genética , Genética Populacional , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Genótipo , Modelos Genéticos , Fenótipo , Melhoramento Vegetal/métodos , Polimorfismo Genético , Locos de Características Quantitativas , Seleção Genética
4.
PLoS One ; 9(2): e90346, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24587335

RESUMO

Genotyping by sequencing (GBS) is a next generation sequencing based method that takes advantage of reduced representation to enable high throughput genotyping of large numbers of individuals at a large number of SNP markers. The relatively straightforward, robust, and cost-effective GBS protocol is currently being applied in numerous species by a large number of researchers. Herein we describe a bioinformatics pipeline, TASSEL-GBS, designed for the efficient processing of raw GBS sequence data into SNP genotypes. The TASSEL-GBS pipeline successfully fulfills the following key design criteria: (1) Ability to run on the modest computing resources that are typically available to small breeding or ecological research programs, including desktop or laptop machines with only 8-16 GB of RAM, (2) Scalability from small to extremely large studies, where hundreds of thousands or even millions of SNPs can be scored in up to 100,000 individuals (e.g., for large breeding programs or genetic surveys), and (3) Applicability in an accelerated breeding context, requiring rapid turnover from tissue collection to genotypes. Although a reference genome is required, the pipeline can also be run with an unfinished "pseudo-reference" consisting of numerous contigs. We describe the TASSEL-GBS pipeline in detail and benchmark it based upon a large scale, species wide analysis in maize (Zea mays), where the average error rate was reduced to 0.0042 through application of population genetic-based SNP filters. Overall, the GBS assay and the TASSEL-GBS pipeline provide robust tools for studying genomic diversity.


Assuntos
Técnicas de Genotipagem/métodos , Software , Zea mays/genética , Animais , Cruzamento , Mineração de Dados , Genética Populacional , Técnicas de Genotipagem/economia , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Polimorfismo de Nucleotídeo Único
5.
Genome Biol ; 14(6): R55, 2013 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-23759205

RESUMO

BACKGROUND: Genotyping by sequencing, a new low-cost, high-throughput sequencing technology was used to genotype 2,815 maize inbred accessions, preserved mostly at the National Plant Germplasm System in the USA. The collection includes inbred lines from breeding programs all over the world. RESULTS: The method produced 681,257 single-nucleotide polymorphism (SNP) markers distributed across the entire genome, with the ability to detect rare alleles at high confidence levels. More than half of the SNPs in the collection are rare. Although most rare alleles have been incorporated into public temperate breeding programs, only a modest amount of the available diversity is present in the commercial germplasm. Analysis of genetic distances shows population stratification, including a small number of large clusters centered on key lines. Nevertheless, an average fixation index of 0.06 indicates moderate differentiation between the three major maize subpopulations. Linkage disequilibrium (LD) decays very rapidly, but the extent of LD is highly dependent on the particular group of germplasm and region of the genome. The utility of these data for performing genome-wide association studies was tested with two simply inherited traits and one complex trait. We identified trait associations at SNPs very close to known candidate genes for kernel color, sweet corn, and flowering time; however, results suggest that more SNPs are needed to better explore the genetic architecture of complex traits. CONCLUSIONS: The genotypic information described here allows this publicly available panel to be exploited by researchers facing the challenges of sustainable agriculture through better knowledge of the nature of genetic diversity.


Assuntos
Cruzamento , Genoma de Planta , Genótipo , Sementes/genética , Zea mays/genética , Alelos , Bancos de Espécimes Biológicos , Mapeamento Cromossômico , Marcadores Genéticos , Sequenciamento de Nucleotídeos em Larga Escala , Desequilíbrio de Ligação , Fenótipo , Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Sementes/classificação , Estados Unidos
6.
Nucleic Acids Res ; 39(Database issue): D1085-94, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21076153

RESUMO

Now in its 10th year, the Gramene database (http://www.gramene.org) has grown from its primary focus on rice, the first fully-sequenced grass genome, to become a resource for major model and crop plants including Arabidopsis, Brachypodium, maize, sorghum, poplar and grape in addition to several species of rice. Gramene began with the addition of an Ensembl genome browser and has expanded in the last decade to become a robust resource for plant genomics hosting a wide array of data sets including quantitative trait loci (QTL), metabolic pathways, genetic diversity, genes, proteins, germplasm, literature, ontologies and a fully-structured markers and sequences database integrated with genome browsers and maps from various published studies (genetic, physical, bin, etc.). In addition, Gramene now hosts a variety of web services including a Distributed Annotation Server (DAS), BLAST and a public MySQL database. Twice a year, Gramene releases a major build of the database and makes interim releases to correct errors or to make important updates to software and/or data.


Assuntos
Bases de Dados Genéticas , Genoma de Planta , Plantas/genética , Mapeamento Cromossômico , Genes de Plantas , Variação Genética , Genômica , Redes e Vias Metabólicas , Plantas/metabolismo , Locos de Características Quantitativas , Sintenia
7.
Brief Bioinform ; 10(6): 664-75, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19933212

RESUMO

Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample size and number of markers used for GWAS is increasing dramatically, resulting in greater statistical power to detect those associations. The use of mixed models with increasingly large data sets depends on the availability of software for analyzing those models. While multiple software packages implement the mixed model method, no single package provides the best combination of fast computation, ability to handle large samples, flexible modeling and ease of use. Key elements of association analysis with mixed models are reviewed, including modeling phenotype-genotype associations using mixed models, population stratification, kinship and its estimation, variance component estimation, use of best linear unbiased predictors or residuals in place of raw phenotype, improving efficiency and software-user interaction. The available software packages are evaluated, and suggestions made for future software development.


Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , Ligação Genética/genética , Genética Populacional , Estudo de Associação Genômica Ampla/métodos , Modelos Genéticos , Software , Simulação por Computador
8.
Int J Plant Genomics ; 2008: 369601, 2008.
Artigo em Inglês | MEDLINE | ID: mdl-18483570

RESUMO

The Generation Challenge programme (GCP) is a global crop research consortium directed toward crop improvement through the application of comparative biology and genetic resources characterization to plant breeding. A key consortium research activity is the development of a GCP crop bioinformatics platform to support GCP research. This platform includes the following: (i) shared, public platform-independent domain models, ontology, and data formats to enable interoperability of data and analysis flows within the platform; (ii) web service and registry technologies to identify, share, and integrate information across diverse, globally dispersed data sources, as well as to access high-performance computational (HPC) facilities for computationally intensive, high-throughput analyses of project data; (iii) platform-specific middleware reference implementations of the domain model integrating a suite of public (largely open-access/-source) databases and software tools into a workbench to facilitate biodiversity analysis, comparative analysis of crop genomic data, and plant breeding decision making.

9.
Nucleic Acids Res ; 36(Database issue): D947-53, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17984077

RESUMO

Gramene (www.gramene.org) is a curated resource for genetic, genomic and comparative genomics data for the major crop species, including rice, maize, wheat and many other plant (mainly grass) species. Gramene is an open-source project. All data and software are freely downloadable through the ftp site (ftp.gramene.org/pub/gramene) and available for use without restriction. Gramene's core data types include genome assembly and annotations, other DNA/mRNA sequences, genetic and physical maps/markers, genes, quantitative trait loci (QTLs), proteins, ontologies, literature and comparative mappings. Since our last NAR publication 2 years ago, we have updated these data types to include new datasets and new connections among them. Completely new features include rice pathways for functional annotation of rice genes; genetic diversity data from rice, maize and wheat to show genetic variations among different germplasms; large-scale genome comparisons among Oryza sativa and its wild relatives for evolutionary studies; and the creation of orthologous gene sets and phylogenetic trees among rice, Arabidopsis thaliana, maize, poplar and several animal species (for reference purpose). We have significantly improved the web interface in order to provide a more user-friendly browsing experience, including a dropdown navigation menu system, unified web page for markers, genes, QTLs and proteins, and enhanced quick search functions.


Assuntos
Produtos Agrícolas/genética , Bases de Dados Genéticas , Genoma de Planta , Arabidopsis/genética , Mapeamento Cromossômico , Produtos Agrícolas/metabolismo , Marcadores Genéticos , Variação Genética , Genômica , Internet , Oryza/genética , Poaceae/genética , Triticum/genética , Interface Usuário-Computador , Zea mays/genética
10.
Bioinformatics ; 23(19): 2633-5, 2007 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-17586829

RESUMO

Association analyses that exploit the natural diversity of a genome to map at very high resolutions are becoming increasingly important. In most studies, however, researchers must contend with the confounding effects of both population and family structure. TASSEL (Trait Analysis by aSSociation, Evolution and Linkage) implements general linear model and mixed linear model approaches for controlling population and family structure. For result interpretation, the program allows for linkage disequilibrium statistics to be calculated and visualized graphically. Database browsing and data importation is facilitated by integrated middleware. Other features include analyzing insertions/deletions, calculating diversity statistics, integration of phenotypic and genotypic data, imputing missing data and calculating principal components.


Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , Análise Mutacional de DNA/métodos , Variação Genética/genética , Desequilíbrio de Ligação/genética , Locos de Características Quantitativas/genética , Software , Interpretação Estatística de Dados
11.
Nucleic Acids Res ; 34(Database issue): D717-23, 2006 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-16381966

RESUMO

Rice, maize, sorghum, wheat, barley and the other major crop grasses from the family Poaceae (Gramineae) are mankind's most important source of calories and contribute tens of billions of dollars annually to the world economy (FAO 1999, http://www.fao.org; USDA 1997, http://www.usda.gov). Continued improvement of Poaceae crops is necessary in order to continue to feed an ever-growing world population. However, of the major crop grasses, only rice (Oryza sativa), with a compact genome of approximately 400 Mbp, has been sequenced and annotated. The Gramene database (http://www.gramene.org) takes advantage of the known genetic colinearity (synteny) between rice and the major crop plant genomes to provide maize, sorghum, millet, wheat, oat and barley researchers with the benefits of an annotated genome years before their own species are sequenced. Gramene is a one stop portal for finding curated literature, genetic and genomic datasets related to maps, markers, genes, genomes and quantitative trait loci. The addition of several new tools to Gramene has greatly facilitated the potential for comparative analysis among the grasses and contributes to our understanding of the anatomy, development, environmental responses and the factors influencing agronomic performance of cereal crops. Since the last publication on Gramene database by D. H. Ware, P. Jaiswal, J. Ni, I. V. Yap, X. Pan, K. Y. Clark, L. Teytelman, S. C. Schmidt, W. Zhao, K. Chang et al. [(2002), Plant Physiol., 130, 1606-1613], the database has undergone extensive changes that are described in this publication.


Assuntos
Mapeamento Cromossômico , Bases de Dados Genéticas , Grão Comestível/genética , Genoma de Planta , Arabidopsis/genética , Genes de Plantas , Marcadores Genéticos , Genômica , Internet , Oryza/genética , Proteínas de Plantas/genética , Locos de Características Quantitativas , Interface Usuário-Computador , Vocabulário Controlado , Zea mays/genética
12.
Bioinformatics ; 20(16): 2839-40, 2004 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-15105279

RESUMO

UNLABELLED: The goal of this project is to simplify access to genomic diversity and phenotype data, thereby encouraging reuse of this data. The Genomic Diversity and Phenotype Connection (GDPC) accomplishes this by retrieving data from one or more data sources and by allowing researchers to analyze integrated data in a standard format. GDPC is written in JAVA and provides (1) data sources available as web services that transfer XML formatted data via the SOAP protocol; (2) a JAVA API for programmatic access to data sources; and (3) a front-end application that allows users to manage data sources, retrieve data based on filters, sort/group data based on property values and save/open the data as XML files. AVAILABILITY: The source code, compiled code, documentation and GDPC Browser are freely available at: www.maizegenetics.net/gdpc/index.html the current release of GDPC is version 1.0, with updated releases planned for the future. Comments are welcome.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Armazenamento e Recuperação da Informação/métodos , Pesquisa , Software , Interface Usuário-Computador , Variação Genética , Disseminação de Informação/métodos , Internet , Fenótipo , Linguagens de Programação , Integração de Sistemas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...